Search CORE

50 research outputs found

Training a Binary Weight Object Detector by Knowledge Transfer for Autonomous Driving

Author: López Antonio M.
Wang Peng
Xu Jiaolong
Yang Heng
Publication venue
Publication date: 25/05/2019
Field of study

Autonomous driving has harsh requirements of small model size and energy efficiency, in order to enable the embedded system to achieve real-time on-board object detection. Recent deep convolutional neural network based object detectors have achieved state-of-the-art accuracy. However, such models are trained with numerous parameters and their high computational costs and large storage prohibit the deployment to memory and computation resource limited systems. Low-precision neural networks are popular techniques for reducing the computation requirements and memory footprint. Among them, binary weight neural network (BWN) is the extreme case which quantizes the float-point into just

1

bit. BWNs are difficult to train and suffer from accuracy deprecation due to the extreme low-bit representation. To address this problem, we propose a knowledge transfer (KT) method to aid the training of BWN using a full-precision teacher network. We built DarkNet- and MobileNet-based binary weight YOLO-v2 detectors and conduct experiments on KITTI benchmark for car, pedestrian and cyclist detection. The experimental results show that the proposed method maintains high detection accuracy while reducing the model size of DarkNet-YOLO from 257 MB to 8.8 MB and MobileNet-YOLO from 193 MB to 7.9 MB.Comment: Accepted by ICRA 201

arXiv.org e-Print Archive

Crossref

Diposit Digital de Documents de la UAB

Pedestrian Detection at Day/Night Time with Visible and FIR Cameras : A Comparison

Author: Fang Zhijie
González Alejandro
López Antonio M.
Serrat Joan
Socarras Yainuvis
Vázquez David
Xu Jiaolong
Publication venue: 'MDPI AG'
Publication date: 01/01/2016
Field of study

Altres ajuts: DGT (SPIP2014-01352)Despite all the significant advances in pedestrian detection brought by computer vision for driving assistance, it is still a challenging problem. One reason is the extremely varying lighting conditions under which such a detector should operate, namely day and nighttime. Recent research has shown that the combination of visible and non-visible imaging modalities may increase detection accuracy, where the infrared spectrum plays a critical role. The goal of this paper is to assess the accuracy gain of different pedestrian models (holistic, part-based, patch-based) when training with images in the far infrared spectrum. Specifically, we want to compare detection accuracy on test images recorded at day and nighttime if trained (and tested) using (a) plain color images; (b) just infrared images; and (c) both of them. In order to obtain results for the last item, we propose an early fusion approach to combine features from both modalities. We base the evaluation on a new dataset that we have built for this purpose as well as on the publicly available KAIST multispectral dataset

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

PubMed Central

Diposit Digital de Documents de la UAB

AniPortraitGAN: Animatable 3D Portrait Generation from 2D Image Collections

Author: Chen Qifeng
Tong Xin
Wei Fangyun
Wu Yue
Xiang Jianfeng
Xu Sicheng
Yang Jiaolong
Publication venue
Publication date: 05/09/2023
Field of study

Previous animatable 3D-aware GANs for human generation have primarily focused on either the human head or full body. However, head-only videos are relatively uncommon in real life, and full body generation typically does not deal with facial expression control and still has challenges in generating high-quality results. Towards applicable video avatars, we present an animatable 3D-aware GAN that generates portrait images with controllable facial expression, head pose, and shoulder movements. It is a generative model trained on unstructured 2D image collections without using 3D or video data. For the new task, we base our method on the generative radiance manifold representation and equip it with learnable facial and head-shoulder deformations. A dual-camera rendering and adversarial learning scheme is proposed to improve the quality of the generated faces, which is critical for portrait images. A pose deformation processing network is developed to generate plausible deformations for challenging regions such as long hair. Experiments show that our method, trained on unstructured 2D images, can generate diverse and high-quality 3D portraits with desired control over different properties.Comment: SIGGRAPH Asia 2023. Project Page: https://yuewuhkust.github.io/AniPortraitGAN

arXiv.org e-Print Archive

Uniconazole-induced starch accumulation in the bioenergy crop duckweed (Landoltia punctata) II: transcriptome alterations of pathways involved in carbohydrate metabolism and endogenous hormone crosstalk

Author: A Bastias
A Conesa
A Conesa
A Gomez-Cadenas
A Sakai
AD Pavlista
AG Bayrakci
AM Smith
AM Smith
B Langmead
C Ghiena
C Guan
C Martin
D-L Yang
DJ Mares
DV Dugas
E Giulia
E Landolt
E Ramireddy
EB Blazey
F Rook
G Wan-zhuo
Guohua Zhang
Hai Zhao
HJ Kim
J Li
J Xu
J Yang
J Yang
JC Oliveros
JE Barrett
JF Tárrago
Jiaolong Sun
JPC To
JS McLAREN
Kaize He
KE Hubbard
L Ge
L Zhang
M Reguera
M Reid
M Zhang
MD Robinson
Mengjun Huang
MG Grabherr
P McCombs
PJ Crutzen
Q Chen
R Fletcher
R Leng
RA Fletcher
RA Fletcher
S Kaur
S Rentzsch
S-Y Park
SA El-Shafai
SM Pilkington
T Akihiro
T Searchinger
T Umezawa
T Werner
TD Davis
W Cui
W Wang
W Zhou
WS Hillman
X Ge
X Ji
X Tao
X Wang
Xiang Tao
Y Nakamura
Y Wang
Y Wang
Y Wu
Y Wu
Y Xiao
Y Zhao
Yang Fang
Yang Liu
Yanling Jin
Yun Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref